A Symbolic Representation Method to Preserve the Characteristic Slope of Time Series
نویسندگان
چکیده
In recent years many studies have been proposed for knowledge discovery in time series. Most methods use some technique to transform raw data into another representation. Symbolic representations approaches have shown effectiveness in speedup processing and noise removal. The current most commonly used algorithm is the Symbolic Aggregate Approximation (SAX). However, SAX doesn’t preserve the slope information of the time series segments because it uses only the Piecewise Aggregate Approximation for dimensionality reduction. In this paper, we present a symbolic representation method to dimensionality reduction and discretization that preserves the behavior of slope characteristics of the time series segments. The proposed method was compared with the SAX algorithm using artificial and real datasets with 1-nearest-neighbor classification. Experimental results demonstrate the method effectiveness to reduce the error rates of time series classification and to keep the slope information in the symbolic representation.
منابع مشابه
A New Algorithm for Optimization of Fuzzy Decision Tree in Data Mining
Decision-tree algorithms provide one of the most popular methodologies for symbolic knowledge acquisition. The resulting knowledge, a symbolic decision tree along with a simple inference mechanism, has been praised for comprehensibility. The most comprehensible decision trees have been designed for perfect symbolic data. Classical crisp decision trees (DT) are widely applied to classification t...
متن کاملA new approach to detect Life threatening cardiac arrhythmias using Sequential spectrum of Electrocardiogram signals
This study evaluates the discriminative power of sequential spectrum analysis of the short-term electrocardiogram (ECG) time series in separating normal and subjects with life threatening arrhythmias like, ventricular tachycardia/fibrillation (VT/VF). The raw ECG time series is transformed into a series of binary symbols and the binary occupancy or relative distribution of mono-sequences (i.e. ...
متن کاملA new approach to detect Life threatening cardiac arrhythmias using Sequential spectrum of Electrocardiogram signals
This study evaluates the discriminative power of sequential spectrum analysis of the short-term electrocardiogram (ECG) time series in separating normal and subjects with life threatening arrhythmias like, ventricular tachycardia/fibrillation (VT/VF). The raw ECG time series is transformed into a series of binary symbols and the binary occupancy or relative distribution of mono-sequences (i.e. ...
متن کاملA Sparse Representation Method to Detect Saffron Agricultural Lands Using Sentinel-II Satellite Images Time
Nowadays, agricultural management via remote sensing technology has gained a special position among managers and the people who are in charge of this industry. Saffron (Red Gold) is one of specific Iran’s agricultural products with a high economic valance which is used in different fields of food and medical industries. Considering the cultivation conditions of the saffron, there has not a pers...
متن کاملAlgorithms for Segmenting Time Series
As with most computer science problems, representation of the data is the key to ecient and eective solutions. Piecewise linear representation has been used for the representation of the data. This representation has been used by various researchers to support clustering, classication, indexing and association rule mining of time series data. A variety of algorithms have been proposed to obtain...
متن کامل